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O , ABSTRACT 

We study the spatial clustering of 538 X-ray selected AGN in the 2 deg- XMM-COSMOS field that are spectroscopically identified 
Kji \ with Iab < 23 and span the redshift range z = 0.2 - 3.0. The median redshift and luminosity of the sample are z = 0.98 and Lo.5-10 = 

I^N , 6.3 X 10*^^ erg s"', respectively. A strong clustering signal is detected at ~ 18cr level, which is the most significant measurement 

■ obtained to date for clustering of X-ray selected AGN. By fitting the projected correlation function w(rp) with a power law on scales 

of fp = 0.3 - 40 /j"' Mpc, we derive a best-fit comoving correlation length of = 8.6 + 0.5 h^^ Mpc and slope of 7 = 1.88 ± 0.07 
(Poissonian errors; bootstrap errors are about a factor of 2 larger). An excess signal is observed in the range rp ~ 5 - 15 /r' Mpc, 
which is due to a large-scale structure at z ~ 0.36 containing about 40 AGN, a feature which is evident over many wavelengths in 
the COSMOS field. When removing the z ~ 0.36 structure or computing w{rp) in a narrower range around the peak of the redshift 
distribution (e.g. z = 0.4 - 1.6), the correlation length decreases to ro ~ 5 - 6h^' Mpc, which is consistent with what is observed for 
bright optical QSOs at the same redshift. 

We investigate the clustering properties of obscured and unobscured AGN separately, adopting different definitions for the source 
obscuration. For the first time, we are able to provide a significant measurement for the spatial clustering of obscured AGN at z ~ 1 . 
Within the statistical uncertainties, we do not find evidence that AGN with broad optical lines (BLAGN) cluster differently from 
AGN without broad optical lines (non-BLAGN). Based on these results, which are limited by object statistics, however, obscured and 
unobscured AGN are consistent with inhabiting similar environments. 

The evolution of AGN clustering with redshift is also investigated. No significant difference is found between the clustering properties 
of XMM-COSMOS AGN at redshifts below or above z = 1. 

The correlation length measured for XMM-COSMOS AGN at z ~ 1 is similar to that of massive galaxies (stellar mass > 
3 X 10'° Mo) at the same redshift. This suggests that AGN at z ~ 1 are preferentially hosted by massive galaxies, as observed both in 
the local and in the distant (z ~ 2) Universe. According to a simple clustering evolution scenario, we find that the relics of AGN are 
expected to have a correlation length as large as ro ~ 8 Mpc by z = 0, and hence to be hosted by local bright (L ~ L^) ellipticals. 
We make use of dark matter halo catalogs from the Millennium simulation to determine the typical halo hosting moderately luminous 
z ~ 1 AGN. We find that XMM-COSMOS AGN live in halos with masses M > 2.5 x 10'^ Mq . By combining the number density 
of XMM-COSMOS AGN to that of the hosting dark matter halos we estimate the AGN duty cycle and lifetimes. We find lifetimes 
approximately of 1 Gyr for AGN at z ~ 1, which are longer than those estimated for optically bright QSOs at the same redshift. These 
longer lifetimes mainly reflect the higher number density of AGN selected by X-ray samples. 
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1. Introduction 

Several pieces of evidence point towards an intimate correlation 
between the evolution of galaxies and the accretion and growth 
of supermassive black holes (SMBHs) at their centers, indicat- 
ing that most galaxies in the Universe spent a fraction of their 
lifetimes as active galactic nuclei (AGN). In the local Universe, 
most galaxy bulges indeed host a supermassive black hole (see 
e.g. Ferrarese & Ford, 2005' for a review), whose mass scales 
with the bulge mass and stellar velocity dispersion (Ferrarese 
& Men-itt 2000, Gebhart et al. 2000). Furthermore, the growth 
of SMBHs during active accretion phases, which is traced by the 
cosmological evolution of the AGN luminosity function (Ueda et 
al. 2003, Hasinger et al. 2005, La Franca et al. 2005, Silverman 
et al. 2008a), has been shown to eventually match the mass func- 
tion of SMBHs in the local Universe (e.g. Marconi et al. 120041 
Yu & TremaineHOOl Shankar et al. 2004). 

While the SMBH vs galaxy co-evolution is now an ac- 
cepted scenario, the details of this joint evolution are not fully 
understood yet. Nuclear activity in bright QSOs is thought 
to be induced by major mergers or close encounters of gas- 
rich galaxies in the context of hierarchical structure formation 
(e.g. Kauffmann & Haehnelt 2000, CavaUere & Vittorini 2002, 
Hopkins et al. 2006). Alternatively, nuclear activity can be sim- 
ply related to the physical processes (e.g. star formation) going 
on in a single galaxy, without being induced by mergers or in- 
teractions with neighboring objects (e.g. Granato et al. 2004). 
Overall, the role played by the environment in triggering both 
nuclear activity and star formation is still a matter of debate. 

Just like local ultraluminous infrared galaxies (Sanders & 
Mirabel 1996), the population of bright submillimeter sources 
at z ~ 2 (Chapman et al. 2003) is hosting both star formation 
and nuclear activity (Alexander et al. 2005), as a result of galaxy 
interactions (Tacconi et al. 2008). However, the majority of z ~ 2 
AGN selected at faint X-ray fluxes seem to be hosted by galaxies 
with a spectral energy distribution typical of passively evolving 
objects (Mainieri et al. 2005). The concurrent growth of black 
holes and stellar mass has been observed in IR galaxies at z ~ 2 
by Daddi et al. (2007), who suggested a long-lived (> 0.2 Gyr) 
AGN plus starburst phenomenon, unlikely to be triggered by 
rapid merger events. In the local galaxies observed by the Sloan 
Digital Sky Survey (SDSS, York et al. 2000), nuclear activity 
does not appear to be correlated to the presence of close com- 
panions, while star formation does (Li et al. 2008). A common 
merger origin for both phenomena cannot be ruled out, however, 
provided they occur at different times (see Li et al. 2008). 

The relation between nuclear activity and the environment 
can be studied via clustering techniques in the context of large- 
scale structure formation, in which the growth of baryonic struc- 
tures is supposed to follow the formations of dark matter halos 
(DMHs). 

The comparison between the clustering properties of AGN 
and those of DMHs predicted by cold dark matter (CDM) mod- 
els can be used to evaluate the typical mass of the DMHs in 
which AGN form and reside as a function of cosmic time. The 
most recent measurements have shown that bright QSOs in the 
redshift range 0-3 reside into DMHs of mass M > 10 '^ Mq 
(Grazian et al. '2004'; Porciani et al. 2004, Croom et al. l2005l 
but see Padmanabhan et al. 2008 for lower mass estimates at 
z ~ 0.3). In addition, the ratio between the AGN space density 
and the space density of host DMHs may provide an estimate of 
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the AGN lifetime (e.g. Martini & Weinberg 2001). Current esti- 
mates are largely uncertain, constraining the AGN lifetime in the 
range of a few xlO^ - 10^ yr (Grazian et al. 120041 Porciani et al. 
2004). Finally, the comparison between the clustering properties 
of different galaxy types and AGN can be used to estimate AGN 
hosts and to estimate the descendant and progenitors of AGN at 
any given redshifts. 

AGN clustering has been traditionally studied by means of 
the two-point correlation function applied to optically selected 
QSO samples (e.g. Shanks et al. 1987. La Franca et al. ,1998] l. 
The most recent and solid results of these analyses come from 
the two largest QSO surveys to date, namely the 2dF QSO 
Redshift Survey (2QZ, e.g. Croom et al. l2005l l. and the SDSS. 
The 2QZ is based on a sample of more than 20000 objects 
with redshifts 0.2 < z < 3.0. When calculating the correla- 
tion function in real space and approximating it with a power 
law ^(r) = (flro)^^, the QSO correlation length and slope were 
found to be ro - 5.0 + 0.5 /i"' Mpc and y = 1.8 5 + 0.13 at a me- 
dian redshift of z = 1 .5 (Da Angela et al. l2005l l. Some evidence 
of a flattening towards smaller scales was also reported, with 
y — 1.45 at projected scales below 10 /i"' Mpc (Da Angela et 
al. l2005l l. The clustering level of 2QZ QSOs is similar to that of 
early type galaxies at the same redshift (Coil et al. 2004, Meneux 
et al. 2006), suggesting they reside in environments of similar 
density. A tentative detection of z ~ 1 AGN residing preferen- 
tially in the same environment of blue rather than red galaxies 
has been reported by Coil et al. (2007). The QSO clustering is 
observed to be a strong function of redshift (Croom et al. 2005, 
Porciani & Norberg 2006), with the correlation length of lumi- 
nous QSOs at z ~ 4 being as high as ro - 24 h^^ Mpc (Shen et 
al. 2007). This suggests that luminous, early QSOs are hosted by 
the most massive and rare DMHs and hence form in the highest 
density peaks of the dark matter distribution. The evidence of lu- 
minosity dependent clustering is, on the contrary, still marginal 
(Porciani & Norberg 2006). 

The above results are mostly based on AGN selected by 
means of their blue optical colors and broad optical lines; i.e., 
they essentially refer to unobscured, type 1 AGN. With the no- 
table exception of the measurement performed by the SDSS on 
a local sample of narrow-line AGN (Li et al. 2006), to date 
there has been no information on the clustering properties of 
obscured AGN, which, based on the results from deep X-ray 
surveys (Brandt & Hasinger 2005, Tozzi et al. 2006) and X-ray 
background synthesis models (e.g. Gilli, Comastri & Hasinger 
2007a), are found to be a factor of > 4 more numerous than 
unobscured ones; i.e., they are the most abundant AGN popula- 
tion in the Universe, dominating the history of accretion onto 
SMBHs (e.g. Fabian 1999). If the unified model strictly ap- 
plies, i.e. the nuclear obscuration is just an orientation eff'ect, one 
should not expect diff'erences in the clustering properties of ob- 
scured and unobscured AGN. However, several exceptions to the 
strict unified model are known. Source obscuration is in many 
cases related to the gas content and evolutionary stage of the host 
galaxy, rather than to a small-scale torus intercepting the line of 
sight (Malkan et al. 1998). Models have been proposed in which 
the onset of nuclear activity starts embedded in an envelope of 
gas and dust, which is later on swept out by the QSO radiation 
(see e.g. Hopkins et al. 2006). If this were the case, obscured and 
unobscured AGN would be just two subsequent stages along a 
galaxy lifetime. The diff'erent durations of these two stages and 
their relation with the environment may produce different clus- 
tering properties between obscured and unobscured AGN. 

One obvious way to obtain samples of obscured AGN is 
through X-ray observations. Besides reducing the obscuration 
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bias dramatically, especially in the hard 2-10 keV band. X-ray 
selection also has the advantage of being effective in selecting 
distant low-luminosity AGN, whose optical light is diluted by 
the host galaxy emission and therefore missed by color-based 
optical surveys. 

In the past years the limited sample size of X-ray selected 
AGN prevented clustering analyses as detailed as for optically 
selected objects. In particular, the lack of dedicated optical 
follow-up programs of X-ray sources providing large samples 
with spectroscopic measurements, has not allowed accurate es- 
timates of the spatial clustering of X-ray selected AGN, limit- 
ing most studies to angular clustering. Numerous investigations 
of the two point angular correlation function of X-ray sources 
have indeed appeared in the literature, but the results suffer 
from rather large uncertainties. Early attempts to measure the 
angular clustering of X-ray selected sources were performed by 
VikhHnin et al. (1995) and Carrera et al. (1998) based on ROSAT 
pointings. More recent results based on Chandra and XMM data 
have been obtained by Basilakos et al. (2004), Gandhi et al. 
(2006), Puccetti et al. (2006), Miyaji et al. (2007), Carrera et 
al. (2007), Plionis et al. (2008), and Ueda et al. (2008). In par- 
ticular, Miyaji et al (2007) and Gandhi et al. (2006) have com- 
puted the angular correlation function over contiguous areas of a 
few square degrees (the 2 deg^ COSMOS field and the ~4 deg^ 
XMM-LSS field, respectively), which should reduce the impact 
of cosmic variance. In the COSMOS field, Miyaji et al. (2007) 
measured a correlation length of about 10+1 Mpc, while 
only a loose constraint (ro = 3 - 9 /i"') Mpc was obtained in the 
XMM-LSS by Gandhi et al. (2006). Very recently an attempt to 
measure the angular clustering of high-redshift (z ~ 3), X-ray 
selected AGN has been done by Francke et al. (2008). 

The few examples of spatial clustering of X-ray selected 
sources appeared in the literature are limited by low statistics. 
Based on a sample of ~ 220 QSOs at z ~ 0.2 found in the 
80deg2 ROSAT North Ecliptic Pole survey (NEP, Gioia et al. 
I2003I I. Mullis et al. (2004) were able to measure a correlation 
signal to > 3cr level. By fixing the correlation slope to y = 1.8, 
they found a best-fit correlation length of ro - 7.4;^] ^ /i"' Mpc. 
Because of the relatively short exposures in the NEP survey 
and the limited ROSAT sensitivity and bandpass (0. 1 -2.4 keV), 
only bright, luminous, unobscured QSOs have been detected in 
this sample (median Lo.5-2keV ~ 9 x lO'*-' erg s"', correspond- 
ing to Lo.5-iokeV ~ 2 X 10"*"^ erg s"' for a spectral slope of 
a = 0.9). Later, by analyzing data from the Chandra Deep 
Field South (CDFS, Rosati et al. 2002) and Chandra Deep Field 
North (CDFN, Alexander et al. 2005 ), Gilli et al. 2005 were 
able to detect clustering at > 7cr level for z ~ 0.7 AGN with 
i'O.s-iokeV ~ lO'*^ erg s"' . However, the best-fit correlation length 
was found to vary by a factor of ~ 2 between the two fields 
(ro = 10 h-'- Mpc in the CDFS, ro = 5/1' Mpc in the CDFN), 
revealing strong cosmic variance over these small, 0. 1 deg^ each, 
sky areas. Although with limited significance (~ 3cr), in the 
CDFs, it was also possible to determine the clustering proper- 
ties of obscured AGN only, which did not show significant dif- 
ferences with respect to those of unobscured ones within the un- 
certainties (Gilli et al. ,2005 ). The most recent measurement is 
the one performed in the larger, 0.4 deg^ field covered by the 
CLASXS (Yang et al. 2006). A coiTelation length of ~ 5.7/1"' 
Mpc was found for X-ray selected AGN at z ~ 1 .2, with average 
luminosity of Lo.5-iokeV ~ 6 x 10"*^ erg s"'. 

A large number of X-ray surveys are ongoing and are ex- 
pected to provide larger samples of sources over wide sky ar- 
eas and with different limiting fluxes. They will allow studies of 
clustering of AGN in diff'erent redshift and luminosity regimes. 
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Fig. 1. Distribution on the sky of the 1822 pointlike sources 
detected by XMM (crosses). Spectroscopically identified ob- 
jects are shown as filled circles. The area covered by spectro- 
scopic observations is also shown: the large circles show the 16 
Magellan IMACS pointings, while the polygon represents the 
area presently covered by zCOSMOS observations. 



A few examples of such surveys are X-Bootes (Murray et al. 
2005), XMM-LSS (Pierre et al. 2007), Extended CDFS (Lehmer 
et al 2005), AEGIS (Nandra et al. 2005), and XMM-COSMOS 
(Hasinger et al. 2007). One of these samples, the XMM survey in 
the 2 deg2 COSMOS field (XMM-COSMOS), has been specif- 
ically designed to study with the best statistics the clustering of 
X-ray selected AGN. One of the main goals of XMM-COSMOS 
was indeed to provide the best measurement to date of the corre- 
lation function of X-ray selected AGN and to allow a reliable 
measurement of the correlation function of obscured AGN at 
z > for the first time. 

The optical spectroscopic identification of the 1822 point- 
like X-ray sources detected by XMM-COSMOS continues. In 
this paper we present the results based on the first third of the 
objects. The paper is organized as follows. In Sect. 2 we summa- 
rize the X-ray and optical follow-up observations of the XMM- 
COSMOS sample and present the source catalog used in our 
analysis. In Sect. 3 we describe the methods of estimating the 
correlation function of X-ray selected sources. In Sect. 4 several 
safety checks are performed to validate the adopted techniques. 
The results of our analysis are presented in Sect. 5. In Sect. 6 
the results are discussed and interpreted. The conclusions and 
prospects for future work are finally presented in Sect. 7. 

Throughout this paper, a flat cosmology with f2„, = 0.25 and 
Q.A = 0.75 is assumed (Spergel et al. I2007I I. For comparison 
with previous measurements we refer to correlation lengths and 
distances in units of /;"' Mpc comoving, where Ho - 100 /i km 
s ' Mpc Masses of dark matter halos are also expressed in 
units of /;"' Mq for consistency with the Millennium simulation, 
and AGN and halo space densities are expressed in units of li^ 
Mpc AGN luminosities and lifetimes are calculated using /; = 
0.7. 
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2. The sample 

The XMM-COSMOS survey is part of the COSMOS legacy 
project (see Scoville et al. 2007a for an overview of the sur- 
vey), an extensive multiwavelength campaign to observe a 
1.4 X 1.4 deg equatorial field centered at (RA, DEC)j2m) - 
(150.1083,2.210). A number of large programs with the major 
observing facilities have been already performed or are ongoing, 
including HST (Scoville et al. 2007b), VLT (Lilly et al. 2007), 
SUBARU (Taniguchi et al. 2007), VLA (Schinnerer et al. 2007), 
XMM (Hasinger et al. 2007), Chandra (Elvis et al. in prepara- 
tion) Spitzer (Sanders et al. 2007), and GALEX (Zamojski et al. 
2007; Schiminovich et al. in preparation). 

XMM-COSMOS is a mosaic made of 53 partially overlap- 
ping XMM pointings that cover the entire 2 deg^ COSMOS field. 
The XMM observations were allocated across two announce- 
ments of opportunities (AO-4 and AO-5) and performed in two 
different passefl for a total of 1 .4 Ms exposure time. Each pass 
was arranged into a regular grid of ~ 30 ks pointings separated 
by 8 arcmin each to cover the full 2 deg^ field. In the second pass 
the grid pattern was shifted by 1 arcmin with respect to the first 
pass to ensure maximum uniformity in the sensitivity over the 
final mosaic. The limiting fluxes reached in the regions of maxi- 
mum exposure are 5 x 10""', 3 x 10"'^, 6 x 10"'^ erg cm"^ s"' 
in the 0.5-2, 2-10, and 5-10 keV, respectively, while the entire 2 
deg^ area is covered down to 2.4x10"'^, 1.5x10"''*, 2x10"'"* erg 
cm"^ s"' in the same bands. In total 1822 pointlike sources have 
been detected in at least one band down to a likelihood threshold 
of 10 (see Cappelluti et al. 2007 for the source detection pro- 
cess). The final catalog will be presented in a forthcoming paper 
(Cappelluti et al. in preparation). A number of results concern- 
ing the first pass (0.8 Msec total exposure) have been published 
by Cappelluti et al. (2007), Miyaji et al. (2007), and Mainieri et 
al. (2007). 

The optical identification of XMM sources is currently in 
progress (see Brusa et al. 2007 for the initial results based on the 
first 12 XMM pointings). Unique optical and/or infrared coun- 
terparts for most (~88%) of the XMM sources have now been 
recognized. Thanks to the Chandra observations in the central 
COSMOS square deg (Elvis et al. in prep), a number of formerly 
ambiguous identifications have now been made secure (Brusa et 
al. in preparation). The main dedicated spectroscopic follow-up 
programs of XMM sources are being conducted with the IMACS 
instrument at the 6m Magellan telescope (Trump et al. 2007) and 
with VIMOS at the VLT within the zCOSMOS program (e.g. 
Lilly et al. 2007). A number of spectroscopic redshifts were also 
obtained by cross-correlating the XMM catalog with published 
spectroscopic catalogs like the SDSS. About 46% of the total 
spectroscopic sample were obtained with only IMACS obser- 
vations, 25% with only zCOSMOS, and another 24% has been 
observed in both programs. The remaining 5% were obtained 
by cross-correlation with public catalogs. Quality flags were as- 
signed to the redshifts measured by IMACS and zCOSMOS. We 
considered here only the 62 1 X-ray pointlike sources with high- 
est quality flags, which have been identified as extragalactic ob- 
jects. By considering the 150 duplicated redshifts (i.e. those ob- 
jects observed by both IMACS and VIMOS) the accuracy in the 
redshift measurements is verified to be ct; < 0.002. 

Since we combine measurements obtained with different op- 
tical instruments by different groups, which adopt somewhat dif- 
ferent spectral classification criteria, it is not easy to obtain a uni- 
form classification scheme for the optical counterparts of XMM 

' We do not consider here two additional pointings performed sepa- 
rately in AO-6. 
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Fig. 2. Soft X-ray luminosity vs redshift relation for the spec- 
troscopically identified sources in XMM-COSMOS. Broad line 
AGN (BLAGN) and non broad Une AGN (non-BLAGN) are 
shown as blue open circles and red filled circles, respectively. 
Only objects in the redshift range z = 0.2 - 3 (vertical dashed 
lines) have been considered for the clustering analysis. 

sources. The only class with a common definition in the var- 
ious catalogs is that of objects showing broad emission lines 
(FWHM> 2000 km/s). Therefore, in the following, as far as opti- 
cal classification is concerned, we simply divide the sample into 
objects with and without broad optical lines, referring to them as 
broad line AGN (BLAGN) and non-BLAGN, respectively (see 
also Brusa et al. 2007). 

The 0.5-2 keV X-ray luminosity vs redshift distribution of 
BLAGN and non-BLAGN, is shown in Fig. |2] BLAGN are on 
average observed at higher redshift and at luminosities above 
2q42.5 gj.g g-i^ while non-BLAGN are observed down to very 
low X-ray luminosities and may therefore include a significant 
fraction of normal galaxies. To exclude from the sample those 
objects that are not likely to be AGN, we then considered only 
those sources at a redshift higher than 0.2. As shown in Fig. |2] 
this cut essentially removes most low-luminosity objects, leav- 
ing in the sample only objects with Lo.5-2 <: 10"*'^ erg s"'.0 
In addition we considered for our analysis only objects at red- 
shifts below 3, since beyond this limit the source density be- 
comes extremely low and the selection function is very uncertain 
(see Sect. 3). 

By exploiting the large multiwavelength database available 
in COSMOS, Salvato et al. (2008) are able to estimate a photo- 
metric redshift for ~ 85% of the XMM sources. For the remain- 
ing 15% of the sample, the main reason for not attempting a pho- 
tometric redshift estimate was the ambiguity in the correct asso- 
ciation with the optical/IR counterpart (see Brusa et al. 2007). 
This issue, however, is not expected to introduce any bias re- 
lated to source distances. Therefore the redshift distribution esti- 
mated using photometric redshifts should be very close to that of 

^ We verihed that adopting a luminosity cut at Lo.5-10 > lO*'-^ erg s"' 
produces similar results as adopting a redshift cut at z=0.2. 
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Fig. 3. I-band magnitude vs redshift distribution of the optical 
counterparts of XMM sources. Objects with spectroscopic red- 
shift or only photometric redshift are shown as filled circles 
or crosses, respectively. The dashed horizontal line shows the 
Iab - 23 magnitude limit considered for the clustering analysis 
(see text). 



the entire XMM sample. On the contrary, spectroscopic redshifts 
have been measured for a minority (~ 36%) of the total XMM 
sample (including 34 stars), making it possible that objects with 
measured spectroscopic redshift are not a fair representation (i.e. 
a random sampling) of the total AGN population detected by 
XMM. The I-band magnitude vs redshift distribution of the ob- 
jects with spectroscopic redshifts and only photometric redshifts 
is shown in Fig.|3] Objects only with photometric redshift are on 
average optically fainter and at higher redshift than objects with 
spectroscopic redshifts. A Kolmogorov-Smirnov test performed 
on the redshifts distribution of objects with and without spec- 
troscopic redshift indicates that they differ at > 3>cr level; i.e., 
the spectroscopic sample is not a fair representation of the en- 
tire AGN population detected by XMM. We therefore impose a 
magnitude cut at Iab < 23, which excludes only a small fraction 
(< 3%) of spectroscopically identified objects but increases the 
spectroscopic completeness to about 60%. The redshift distribu- 
tion of objects brighter than Iab - 23 with and without spec- 
troscopic redshifts are statistically indistinguishable. Therefore, 
we conclude that, for objects with Iab < 23, the spectroscopic 
selection does not include any bias against high-redshift objects. 

In the following we consider the sample of 538 XMM ob- 
jects with Iab < 23 and spectroscopic redshift in the range 
z - 0.2 - 3.0 as our reference sample. The average redshift 
and X-ray luminosity of this sample are z - 0.98 and L0.5-10 = 
6.3x10^^ erg/s, respectively. The redshift distribution of the sam- 
ple sources is shown in Fig. |4] A number of redshift structures 
are observed, the most prominent of which is at z ~ 0.36, also 
observed at other wavelengths in COSMOS (Lilly et al. 2007). 



Fig. 4. Redshift distribution for the 538 X-ray sources in the ref- 
erence sample in bins of Az = 0.01. The solid curve is obtained 
by smoothing the observed redshift distribution with a Gaussian 
with cr. = 0.3 and is used to generate the random control sample 
in the correlation function estimate. 

3. Analysis techniques 

The basic statistics commonly used to measure the clustering 
properties of galaxies is the two point correlation function ^(r), 
defined as the excess probability over random of finding a pair 
with one object in the volume dV\ and the second in the volume 
dVz, separated by a distance r (Peebles [l980l l: 

dP^n\l+4{r)]dVxdV2, (1) 

where n is the mean object space density. In our calculations 
we always refer to comoving distances and volumes. In a flat 
Universe one can simply estimate the comoving distance be- 
tween two objects at redshifts zi and zi separated on the sky 
by an angle 6 with the cosine rule (see e.g. Osmer et al. ll981l) : 



i = yt/l^ + - 2t/ic/2COS0, (2) 

where c/, is the radial comoving distance to object which, again 
in a flat universe, reads as 

= r dz[^,n{l+Z? +^Kr''^- (3) 

^0 Jo 

However, a well known feature of Eq. 2 in measuring pair 
separations is that it is affected by redshift space distortions since 
peculiar velocities combine with the source distance to produce 
the measured redshift. (This is the same effect which in redshift 
surveys transforms galaxy clusters into fingers-of-God). The net 
effect of peculiar velocities is to shift pairs from smaller to larger 
radial separations, thus shifting the clustering power towards 
higher scales. In particular a flatter slope with respect to the real 
space correlation function is generally observed. A similar effect 
is also produced by the uncertainties in the redshift measure- 
ments. 
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Fig. 5. Upper panel: projected correlation function w{rp) com- 
puted at different scales (see label) as a function of the integra- 
tion radius n,„ax- From top middle to bottom panels the correla- 
tion length ro, slope y, and amplitude are shown as a function 
of Umax, respectively. 



To overcome these problems one can resort to the so-called 
projected coiTelation function; 



r 



^(rp,7T)dn, 



(4) 



where ^(r^, n) is the two-point correlation function expressed in 
terms of the separations perpendicular (r^) and parallel (tt) to the 
line of sight as defined in Davis & Peebles ( 119831 ) and applied to 
comoving coordinates: 



An integration limit 7r,„„v has to be chosen in Eq.|4]to max- 
imize the correlation signal. Indeed, one should avoid n^ax val- 
ues that are too high, since they would mainly add noise to the 
estimate of w{rp). On the other hand, scales that are too small, 
comparable to tihe redshift uncertainties and to the pairwise ve- 
locity dispersion (i.e. the dispersion in the distribution of the rel- 
ative velocities of source pairs), should also be avoided since 
they would not allow the whole signal to be recovered. 

The typical uncertainty in the redshift measurements (cr, < 
0.002) corresponds to comoving scales below 6.0 h^^ Mpc at 
all redshifts. The pairwise velocity dispersion measured in the 
local Unive rse (500 - 600 km 5 Marzke et al. 179951 Zehavi 
et al. I2002I I is expected to decrease by ~ 15% at z ~ 1.0 (see 
e.g. the ACDM simulations by Kauffmann et al. 1999), thus cor- 
responding to ~ 3 Mpc. To search for the best integration 
radius n^ax, we measured w(rp) for the XMM-COSMOS refer- 
ence sample for different n^ax values ranging from 3 to 220 h^^ 
Mpc. In Fig. |5] (upper panel), we show the increase of w(rp) 
with the integration radius jT„ax at those projected scales where 
most of the clustering signal is coming from (r,, = 5 - 20 /i"' 
Mpc). The w{rp) values appear to converge for n„ax <: 40/1"' 
Mpc. Similarly, the amplitude of the spatial correlation function 
B = (Fig. |5] bottom panel) or the amplitude of the projected 
correlation function C = A(j)rQ (not shown) is converging for 
^nwx ^ 40 /i ' Mpc0The correlation length ro and slope y are 
strongly correlated: when ro increases, y decreases. The correla- 
tion length appears to reach a maximum at jT^ax ~ 80/;"' Mpc, 
while y is constant in the range jT„ax = 40 - 200 /i"' Mpc. Based 
on these considerations, we adopt jr^ax = 40 /i ' Mpc in the fol- 
lowing analysis, which is the minimum jTmax value at which the 
correlation function converges, and returns the smaller errors on 
the best-fit correlation parameters ro, y and r^. We note that with 
this choice ro is smaller by ~ 10% than the maximum value mea- 
sured at n„,„x ~ 80 /i"' Mpc, but we do not try to correct for this 
small bias. 

To measure ^(r^, n) we created random samples of sources in 
our fields and measured the excess of pairs at separations (r^, tt) 
with respect to the random distribution. We used the minimum 
variance estimator proposed by Landy & Szalay ( II993I) . which 
is found to have a nearly Poissonian variance and to outper- 
form other popular estimators, especially on large scales (e.g., 
see Kerscher et al. 2000): 



[DD] - 2[DR] + [RR] 
[RR] ' 



(7) 



where [DD], [DR] and [RR] are the normalized data-data, data- 
random and random-random pairs, i.e. 



\di 



(di + d2)tan- 



(5) 



Since w(rp) is an integral along the radial coordinate, it is in- 
dependent of peculiar velocity effects and can therefore be used 
as an estimate of the true, real-space correlation function. In par- 
ticular, if the real space correlation function can be approximated 
by a power law of the form ^(r) - (r/ro)"'' and 7T„iax = then 
the following relation holds (Peebles [l980l) : 



w(rp) = A(y)rlrp 



(6) 



where A(y) = r(l/2)r[(y- l)/2]/r(y/2) and r(x) is the Euler's 
gamma function. Then, A(y) increases from 3.05 when y = 2.0 
to 7.96 when y = 1.3. 



[DD]=DD(rp,7T) 
[DK\=DR(rp,n) 



nArir - 1) 
ndind - 1) 

(«.-!) 
ln,i 



[RR]=RR(rp,n), 



(8) 



(9) 



(10) 



where DD, DR, and RR are the number of data-data, data- 
random, and random-random pairs at separations r^ + Ar^ and 
7T + An, and n^ and are the total number of sources in the data 
and random sample, respectively. 



^ Since A(y) varies only by ~ 16% in the range of the measured 
slopes, B and C show almost exactly the same behaviour with K„,ax- 
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Fig. 6. Left panel: projected correlation function for the XMM-COSMOS AGN reference sample (538 objects with z = 0.2 - 3.0 
and Iab < 23, see Table 1). In this and in the following panel, errors are Icr Poisson confidence intervals and the best-fit power 
law is shown as a dashed line. Right panel: projected correlation function of the full sample computed including (open circles) and 
excluding (filled circle) the 42 objects in the large-scale structure at z ~ 0.36: it is evident that the z ~ 0.36 structure is largely 
responsible for the signal excess on scales ~ 5 - 15 Mpc. The projected correlation function for the z - 0.4 - 1.6 sample is 
also shown (filled triangles). For display purposes, the w{rp) datapoints of the different samples have been slightly shifted on the rp 
axis. 



Since both the redshift and the coordinate (a, 6) distributions 
of the identified sources are potentially affected by observational 
biases, special care has to be taken in creating the sample of 
random sources. This has been extensively discussed by Gilli 
et al. (I2005I I for the Chandra Msec Fields (see. e.g. their Sect. 
4.2) where similar problems have been encountered. In that pa- 
per we showed that extracting the coordinates of the random 
sources from the coordinate ensemble of the real sample and 
using the observed redshifts to create a smoothed redshift dis- 
tribution for the random sample is a sufficiently accurate proce- 
dure. As in Gilli et al. (2005), we assumed a Gaussian smoothing 
length cr, = 0.3, which is a good compromise between scales 
that are too small, which would suffer from local density vari- 
ations, and those that are too large, which would oversmooth 
the distribution. We nonetheless verified that our results do not 
change significantly when using a smoothing length in the range 
o", - 0.2 - 0.4. The smoothed redshift distribution adopted for 
our simulations is shown in Fig. HI The adopted procedure, if 
anything, would slightly reduce the correlation signal, since it 
removes the effects of angular clustering. Each random sample 
is built to contain more than 20000 objects. 



We binned the source pairs in intervals of Alog rp— 0.2 and 
measured w{rp) in each bin. The resulting datapoints were then 
fitted by a power law of the form given in Eq. |6] and the best- 
fit parameters y and ro were determined via minimization. 
Given the small number of pairs that fall into some bins (espe- 
cially on the smallest scales), we used the formulae of Gehrels 
( 1198 6) to estimate the 68% confidence interval (i.e. Icr errorbars 
in Gaussian statistics). 



4. Safety checks and error estimates 

A possible concern related to the analysis methods presented in 
the previous section is the random-sample generation. Indeed, 
placing the random sources at the coordinates of the real sources 
completely removes the contribution to the signal due to angular 
clustering. This procedure could therefore underestimate the true 
correlation length. 

We try to quantify this effect by considering a random sam- 
ple simulated according to the XMM-COSMOS sensitivity maps 
(see e.g Miyaji et al. 2007 and Cappelluti et al. 2007). Briefly, 
each simulated source is extracted from a reference input logN- 
logS, placed at random in the XMM-COSMOS field, and kept in 
the random sample if its flux is above the sensitivity map value 
at that position. It is evident that this method is producing a ran- 
dom sample that only accounts for the varying X-ray sensitivity 
along the COSMOS field, but does not account for the positional 
biases related to the optical follow-up program. The result of this 
test is that the measured correlation length increases by ~ 15% 
with respect to the former case. 

An additional test was performed prompted by the X-ray 
flux distribution of objects with spectroscopic redshift being dif- 
ferent from that of the total XMM sample. In particular, the 
fraction of objects with spectroscopic redshift Frac is constant 
(about 70%) for X-ray fluxes fo.5-2keV > 1.6 x lO"'"* erg cm"^ 
s"', while it decreases towards fainter fluxes, reaching 0.0 at 
fo.s-ikev < 7 X 10""'. Objects with spectroscopic redshifts may 
therefore undersample the regions of maximum X-ray sensitiv- 
ity, in which the X-ray source density is higher, producing a 
more regular distribution on the sky than the total XMM sam- 
ple. We therefore created a new random sample by first placing 
sources on the field according to the X-ray sensitivity map as dis- 
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cussed in the previous paragraph and then keeping only a frac- 
tion of them, with a flux-dependent "keeping" probability given 
by the observed relation Frac vs flux described above. When 
computing the projected correlation function using this new ran- 
dom sample, we find a result similar to what was obtained in the 
previous test, i.e. a ~ 16% higher ro value than obtained when 
placing random objects exactly at the coordinates of real objects. 

Again, this new random sample unfortunately also does not 
fully account for the positional biases related to the optical 
follow-up programs. Indeed, given the very complex optical 
follow-up that combines results from different programs, it is im- 
possible for us to estimate the correct selection function of our 
sample, but it is likely that the selection of the masks used for 
optical spectroscopy, which cover the COSMOS field unevenly, 
leaving some patches of the field poorly covered, while covering 
other patches rather extensively, is causing the main positional 
bias. This can be for instance appreciated in Fig. 1, in which 3 of 
the 4 inner circles representing Magellan IMACS pointings (16 
pointings in total) have a higher density of objects with spectro- 
scopic redshift (cyan dots) than all the remaining pointings. We 
therefore believe that the systematic upward shift of 15 - 16% in 
ro that we obtained with these tests is likely to be an upper limit. 
Also, we note that, when performing error analysis considering 
bootstrap errors (see next paragraph), a difference of 15 - 16% 
is within the total error budget. Given this limited difference, 
we are confident that our results are not strongly affected by the 
method used to generate the random source sample. 

While many of the correlation function estimators used in 
the literature have a variance substantially larger than Poisson 
(because source pairs in general are not independent, i.e. the 
same objects appear in more than one pair), the estimator used 
here was shown to have a nearly Poissonian variance (Landy 
& Szalav [19931 1. It has, however, to be noted that the Landy & 
Szalay ( 119931) estimator was originally tested in the approxima- 
tion of weak clustering, so that Poisson errorbars may in our case 
underestimate the true uncertainties. Bootstrap resampling has 
often been used to estimate the uncertainties in the correlation 
function best-fit parameters (e.g. Mo, Jing & Borner il99 2), but 
this technique may return an overestimate of the real uncertain- 
ties (Fisher et al. 1994). We tested bootstrap errors by randomly 
extracting 100 samples of 538 sources each from our total sam- 
ple, allowing for repetitions. The rms in the distribution of the 
best-fit correlation lengths and slopes is a factor of ~ 2.8 and 
~ 2 greater than the Poisson errorbars, respectively. In the fol- 
lowing we simply quote ro and y, together with their Icr Poisson 
errors, bearing in mind that the most likely uncertainty is about 
a factor of 2 higher. 

5. Results 

We first measured the projected correlation function w(rp) of 
all the 538 spectroscopically identified sources in our XMM- 
COSMOS reference sample (Jab < 23 and z - 0.2 - 3) regard- 
less of their optical classification. The correlation function was 
measured on projected scales r^ - 0.3 - 40 /i ' Mpc. Here and 
in the following samples, a simple power law is fit to the data, 
using standard minimization techniques to get the best-fit pa- 
rameters. The best-fit correlation length and slope are found to 
be ro = 8.65 ± 0.45 /i"' Mpc y = 1.88 + 0.07, respectively. 
Based on the error on ro from this two-parameter fit, we esti- 
mate the clustering signal to be detected at > 18o- level, which 
is the most significant clustering measurement to date for X-ray 
selected AGN. The measured correlation length appears to be in 
good agreement with what is estimated by Miyaji et al. (2007) on 



the first pass on XMM-COSMOS through angular clustering and 
Limber's inversion. As shown in Fig. |6^, a signal excess above 
the power-law fit is observed in the scale range r,, ~ 5 - 15 /i"' 
Mpc, which deserves further investigation. As shown in Fig. |4] 
a significant fraction of XMM-COSMOS sources are located 
within a large-scale structure at z ~ 0.36: overall, 42 objects 
are located at z = 0.34 - 0.38, with sky coordinates distributed 
all over the 2 deg 2 fieldQ We verified that, when sources in the 
z - 0.34 - 0.38 redshift range are excluded, the signal excess at 
rp ~ 5 - 15 /i ' Mpc actually disappear^ (see Fig.|6j3). This has 
the effect of reducing the correlation length to ro =6.1+ 0.8 (see 
Table 1). We also verified the eff'ects of restricting the redshift in- 
terval around the peak of the redshift distribution and computing 
w{rp) in that interval. For XMM-COSMOS AGN in the range 
z = 0.4 - 1.6, we found a correlation length of ro ~ 5.2 + l.O/j"' 
Mpc (see Table 1). Similar results are found when the redshift 
interval is restricted to z = 0.5 - 1.5. 

We are repeating our angular auto-correlation function anal- 
ysis using the full three-year data (55 pointings instead of the 
first-year 23 pointings used in Miyaji et al. 2007). We plan 
to use the accurate (w 2%) photometric redshifts presented by 
Salvato et al. (2008) to calculate the angular correlation function 
of XMM AGNs selected by redshift. Preliminary results show 
ro = 9.7 + 2.2/!"' Mpc for the z > sample, which is fully con- 
sistent with the measurement presented in Miyaji et al. (2007), 
while the correlation length is reduced to ro = 8.9 + 2.5/;"' Mpc 
(Icr errors) for the sample obtained by excluding objects in the 
redshift range z - 0.34-0.36. These values are consistent within 
the errors with those obtained with the present analysis, despite 
being systematically higher. The angular analysis results, how- 
ever, still vary with the angular scale range used for fitting, as 
well as with error estimation methods. A more detailed discus- 
sion of the comparison of the results presented in this paper with 
those obtained through the study of the angular auto-correlation 
function and cross-correlation function with galaxies will be pre- 
sented in a future paper (Miyaji et al., in prep). 

5. 1 . AGN clustering as a function of optical type 

It is interesting to investigate the projected correlation function 
for different source subsamples. For each subsample we placed 
the sources of the random sample only at the positions of the 
sources in that subsample. The projected correlation function 
was then fitted both leaving the slope free and fixing it to y = 1.8, 
the standard value measured in most galaxy samples, which is 
also similar to the slope of 1.88 measured for the total XMM- 
COSMOS sample. Fixing the slope allows a more direct compar- 
ison between the correlation lengths of the different subsamples 
when a two-parameter fit is poorly constrained. 

We investigated the clustering properties of sources optically 
classified as broad line AGN (BLAGN) or non-BLAGN. The 
class of non-BLAGN is admittedly a mixed bag, which may 
include obscured AGN, weak unobscured AGN, whose optical 
emission is diluted by the host galaxy light, and normal galaxies. 
The cut at z > 0.2 (roughly corresponding to Lo.5-2 > lO""'^ erg 
s"'), however, should guarantee that the non-BLAGN sample is 



"* Another ~ 25 X-ray selected objects belonging to this structure are 
found using photometric redshifts (see Fig.[3]and Salvato et al. 2008). 

' We note that, because of this excess due to the z ~ 0.36 structure, 
a simple power law is a very poor fit to the w(rp) of our full reference 
sample and of the z < 1 AGN subsample (see Sect. 5.3 and Table 1). In 
contrast, a simple power law provides statistically acceptable fits for all 
the other subsamples analyzed in this work. 
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Fig. 7. Projected correlation function for different XMM- 
COSMOS AGN subsamples. t//?/?er panel: broad line AGN 
(open circles) vs non-broad line AGN (filled circles). Middle 
panel: X-ray unabsorbed (HR < -0.2) AGN (open squares) vs 
X-ray absorbed {HR > -0.2) AGN (filled circles). Since the 
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after removing objects at z ~ 0.36 (open circles). Lower panel: 
AGN at z > 1 (open circles) vs AGN at z < 1 (filled circles). In 
all panels errors are 1 cr Poisson confidence intervals and the best 
fit power laws are shown as dashed lines. 



mostly populated by AGN in which the absence of broad optical 
lines is solely due to nuclear obscuration. Therefore, investigat- 
ing the clustering properties of BLAGN and non-BLAGN should 
be a proxy to investigate the clustering properties of unobscured 
vs obscured AGN. For the sample of 305 BLAGN, we measured 
a correlation correlation length of ro ~ 7.7 ± 0.9 /i"' Mpc and a 
slope of y ~ 2.0 + 0.2, while for the 229 non-BLAGN we mea- 
sured a similar correlation length, ro ~ 7.0 +1.0 /i"' Mpc, and 
a somewhat flatter slope y ~ 1.6 + 0.1. The projected correla- 
tion functions of BLAGN and non-BLAGN AGN are shown in 
Fig-U}( upper panel). It should be noted that a proper comparison 
between the clustering properties of BLAGN and non-BLAGN 
should take possible redshift effects into account, since BLAGN 
are generally observed at higher redshift than non-BLAGN (me- 
dian z - 1.5 vs z = 0.7, see Table 1). In principle, the correla- 
tion length of a given AGN and galaxy population is expected to 
change with redshift, being intimately related to the evolution of 



the hosting dark matter halos, and one should therefore compare 
source populations at the same redshift to establish whether they 
reside in the same environment or not. We will return to this in 
the discussion. 



5.2. AGN clustering as a function of X-ray absorption 

To investigate the clustering properties of obscured vs unob- 
scured AGN further, we also considered the column density 
measurements obtained from the spectral analysis of XMM data. 
We considered here the measurements performed by Mainieri et 
al. (2007 and in prep.), who performed X-ray spectral fits for 
those objects with more than 100 counts in the 0.3-10 keV band 
and found absorption in excess of the Galactic value in about 
25% of their sample (see details in Mainieri et al. 2007). We 
considered here only objects with more than 200 counts in the 
0.3-10 keV band, for which the determination of the column den- 
sity is more reliable. By matching the Mainieri et al. objects with 
more than 200 counts with our reference sample, we end up with 
290 objects for which a column density has been estimated: 70 of 
these do show absorption in excess of the galactic value. We note 
that 21 out of the 190 broad line AGN with measured column 
density have absorption in excess of the Galactic value, which 
is consistent with the 10% fraction of X-ray absorbed broad line 
AGN found in other X-ray selected samples (see e.g. Tozzi et 
al. 2006, Brusa et al. 2003). A few words of caution should, 
however, be spent on these sources. First, simulations run on in- 
put X-ray unabsorbed spectra show that, especially for sources 
at high redshift and relatively low photon statistics, the spectral 
fit may return spurious positive values for the absorption (see 
e.g. Tozzi et al. 2006). Second, the fraction of X-ray absorbed 
sources might be related to the significance threshold used to 
assess the presence of X-ray absorption. In particular, Mainieri 
et al. consider an X-ray source as absorbed if the addition of a 
photoelectric cut off in the spectral fit improves it a a level of 
more than 90% as assessed by an F-test. One would then expect 
that, in about 7 out of 70 absorbed sources (either BLAGN or 
non-BLAGN), the measured column density is spurious. At any 
rate, results do not change significantly if we include those 21 
candidate X-ray absorbed BLAGN in the total X-ray absorbed 
sample or not. 

We first verified that the projected correlation function of the 
290 objects with X-ray spectroscopy is consistent with that of 
our full reference sample and then tried to measure the projected 
correlation function for absorbed and unabsorbed sources sepa- 
rately. Unfortunately, the small number statistics prevent us from 
getting a significant clustering signal for the 70 X-ray absorbed 
AGN, while for the 220 X-ray unabsorbed objects we found a 
correlation length of ro = 10.6 ± 0.9, somewhat higher than that 
measured for the subsample of BLAGN. When restricting the 
analysis to the redshift range z=0.4-1.6, i.e. around the peak of 
the redshift distribution, which also excludes the redshift struc- 
ture at z=0.36, the correlation length of the X-ray unabsorbed 
objects decreases to ro - 6.2 + 1 .4, consistent with what is mea- 
sured for the full reference sample in the same redshift interval. 

To overcome the limitations due to the small size of the sam- 
ple of objects with reliable Nh measurements, we tried to cal- 
ibrate a relation between the X-ray column density Nh and the 
hardness ratio (HR), defined as the difference between the source 
X-ray photons detected in the 2-10 keV band and those detected 
in the 0.5-2 keV band, normalized to the sum of the photons 



10 R. Uilli et al.: I he spatial clustering ot AGN in the XMM-COSMOS held 

Table 1. Summary of best-fit clustering parameters. 



Sample 


M' 


z range 


-z" 


log^O.5-10 


yi iVipLJ 


7 


'-o(r= 1.8) 
yn ivipLj 


All AGN 

No z=0.36 spike 

z ~ 1 AGN 


538 
496 
349 


0.2-3.0 
0.2-3.0 
0.4-1.6 


0.98 
1.03 
0.94 


43.8 
43.9 
43.7 


6 12+"" 

" -0 89 
c 17+0.80 
•'■^ ' -1.14 


1 88+""* 
1 74+0.13 

-0 14 
1 79+0.17 
'-^ '"^-0.18 


6 32+°-5^ 

s 44+0.62 


BLAGN 


305 


0.2-3.0 


1.45 


44.3 


7.66+°-'*' 
7 03+^-^'^ 

'•"■-'-1.18 


1 98+" " 


7 03+°'* 

-0.89 


non-RT AGN 


229 


0.2-1.3 


0.70 


43.2 


^■°"-0.14 


7 7^+0.62 


LJD AT 

HK < — U.Z 

HR < -0 2 7 < 1 3 

/f^ < -0.2,no spike 


/no 
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Fig. 8. Left panel. Column density Nh vs hardness ratio distribution for the 290 objects in our reference sample with more than 200 
photons in the 0.5-10 keV band, for which an accurate Nh measurement can be performed through X-ray spectral fitting. Objects 
that do not show absorption have been plotted at logA^//=20.4, i.e. at the Galactic value. Broad line AGN and non-broad line AGN 
are shown as blue open and red filled symbols, respectively. Circles refer to objects detected in both soft and hard X-ray bands, 
for which an HR value is directly measured. Upward (downward) pointing triangles represent upper (lower) limits to the hardness 
ratios. It is evident that most objects above the dotted line at HR - -0.2 are absorbed by column densities logNn > 21.6. Right 
panel. Hardness ratio vs redshift plot for the 538 sources in our reference sample. Symbols are as in the left panel. The dotted Une 
at HR = -0.2 marks the adopted dividing fine between X-ray absorbed and unabsorbed AGN. 



in the two bandsQ The distribution of the column density vs 
hardness ratio for objects with more than 200 counts is shown 
in Fig. [8] (left panel): most of the objects with HR > -0.2 do 
show absorption in excess of logA^// = 21.6, therefore we adopt 
a rough threshold at HR - -0.2 to divide X-ray absorbed from 
X-ray unabsorbed AGN (see also Hasinger 2008 and Fig. 11 

* Upper (lower) limits to the hardness ratio for objects that are not 
detected in the soft (hard) band have been computed assigning 40 counts 
in the non-detection band to each source. The distribution of the counts 
of the detected sources indeed shows a turnover at this value, which 
therefore appears as the average count threshold for source detection. 



in Mainieri et al. 2007). The HR distribution as a function of 
redshift for the 538 objects in the reference sample is shown in 
Fig.Miright panel). Most BLAGN fall below the HR=-0.2 line, 
while non-BLAGN AGN do show higher HR values on average. 
The poorly populated upper-right corner of the figure, i.e. the 
high-HR - high-z region, suff'ers from obvious selection effects 
due to i) the bias against faint (distant and absorbed) magnitude 
targets in the optical spectroscopy follow-up (see also Fig. 4 in 
Brusa et al. 2007) and ii) the K-correction effects that make high- 
redshift absorbed spectra to appear softer in the X-ray bandpass 
(i.e. lower HR values). It is noted that BLAGN form a sort of 
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horizontal sequence at HR ~ -0.5, which is indeed the hard- 
ness ratio value expected by a canonical power law spectrum 
with photon index F = 1 .7 and no absorption. Since we consider 
all objects with HR < -0.2 as unabsorbed sources, the adopted 
cut conservatively accounts for any dispersion in the photon in- 
dex distribution of BLAGN. We measured w{rp) for absorbed 
and unabsorbed objects separately. At projected scales below 
rp ~ \ h^^ Mpc, absorbed and unabsorbed AGN are similarly 
correlated, while absorbed AGN appear less correlated on larger 
scales. This results in absorbed AGN formally having a lower 
correlation length (ro = 5.1 + 1.7 vs 9.6 ± 0.4 /i"' Mpc) and a 
steeper slope (y = 2.4+0.7 vs 2.0+0.1) than unabsorbed objects. 
The projected correlation function for unabsorbed AGN does 
not change significantly if we restrict the analysis to the redshift 
range 0.2-1.3, i.e. the same range as used for absorbed AGN (see 
Table 1). The projected correlation function of unabsorbed and 
absorbed AGN in the same z=0.2-1.3 redshift interval are shown 
in Fig.^{middle panel). The larger correlation length measured 
for unabsorbed objects is essentially due to most objects in the 
z=0.36 structure having HR < -0.2 (see Fig. [8] right panel). 
Indeed, when removing this structure, the correlation length for 
X-ray unabsorbed AGN decreases to ro - 6.7 ± 0.8 (see Fig. |7] 
middle panel), which, given the large errorbars, is not signifi- 
cantly different from that of X-ray absorbed objects. To summa- 
rize, from our analysis we cannot claim that X-ray absorbed and 
X-ray unabsorbed AGN possess different clustering properties. 

5.3. AGN clustering as a function of redshift 

The study of AGN clustering as a function of redshift provides 
several pieces of information about the formation and evolution 
of the AGN population. 

Because of the limited sample size, we simply split the 
XMM-COSMOS AGN sample in two subsamples of objects be- 
low and above redshift 1 . The correlation functions of the 276 
AGN at z < 1 and of the 262 AGN at z > 1 are shown in 
Fig. |7] lower panel. The best-fit correlation parameters for ob- 
jects at z < 1 are found to be ro ~ 8.0 ± 0.5 Mpc and 
y ~ 1.8 + 0.1, while for objects at z > 1 the best-fit parameters 
are ro ~ 6.7 + 1.6/z"' Mpc and y ~ 1.9 + 0.3. When removing the 
z - 0.36 structure, the correlation length of objects at z < 1 de- 
creases to ro ~ 5.2 + 0.8 Mpc (see Table 1). In Sect. 6.3. we 
discuss the correlation lengths of the various XMM-COSMOS 
redshift subsamples as compared to those of other optical and 
X-ray selected samples at diflFerent redshifts. 

5.4. AGN clustering as a function of luminosity 

We finally investigated the dependence of the AGN cluster- 
ing parameters on the X-ray luminosity, since this may reveal 
whether objects shining with different luminosities reside in dark 
matter halos with different masses, hence constraining the distri- 
bution of the AGN Eddington ratios (see eg. Lidz et al. 2006, 
Marulli et al. 2008 and the discussion in Sect. 6.3). Again, be- 
cause of the limited size of the sample, we simply divided it into 
two almost equally populated subsamples, the dividing line be- 
ing at Lo.5-wkeV = 10'*'* erg s"' (corresponding to Lo.5-2fcv ~ 
]^q43.5 gj-g g-1 fQj. ^ typical AGN X-ray spectrum). As shown in 
Fig. 12] splitting the sample at this luminosity is almost equal to 
splitting the sample at a redshift of z ~ 1 . Indeed, when comput- 
ing the clustering parameters of the higher (lower) luminosity 
sample, these are very similar to the z > 1 (z < 1) sample, and 
therefore are not reported here. 



6. Discussion 

6. 1 . Comparison with other z ~ 1 AGN and galaxy samples 

The galaxy and AGN census in the z ~ 1 Universe has been re- 
cently enlarged by a number of surveys with different areas and 
sensitivities, which allowed investigation of the spatial distribu- 
tion of different populations. The comparison between the clus- 
tering properties of AGN and galaxies allows to first approxima- 
tion to infer which galaxy population is hosting any given AGN 
population, under the simple hypothesis that AGN activity at a 
given redshift is randomly sampling the host galaxy population. 
The comparison between z ~ 1 AGN samples obtained from sur- 
veys with different sensitivities may also reveal any dependence 
of AGN clustering on luminosity. As far as X-ray selected AGN 
are concerned, a correlation length of ro = 5.7^j-^ Mpc has 
been measured for ~ 230 objects in the 0.4 deg^ CLASXS sur- 
vey (Yang et al. 2006). Objects in the CLASXS have redshift 
and luminosity distributions very similar to those of our sample, 
and therefore they should trace the AGN population sampled by 
XMM-COSMOS almost exactly. To check whether the different 
techniques used for the clustering analysis may introduce sig- 
nificant differences, we analyzed the CLASXS sample using the 
same techniques as were used in this work finding best-fit clus- 
tering parameters in very good agreement with the Yang et al. 
(2006) values. The difference between the ro values measured in 
XMM-COSMOS and in CLASXS full samples (8.6 vs 5.7 h'^ 
Mpc, respectively) therefore appears to be inherent to the two 
fields considered. While a prominent redshift spike is observed 
at z=0.36 in XMM-COSMOS, no such similar structures are 
found in the CLASXS field. Indeed, when removing the struc- 
ture at z = 0.36, the correlation length of XMM-COSMOS AGN 
decreases to ~ 6.3/1"' Mpc, in good agreement with the value 
measured in CLASXS. Moreover, when restricting the analysis 
to XMM-COSMOS AGN in the redshift range z = 0.4 - 1.6, 
the correlation length (~ 5.2 h^^ Mpc) is very similar to what is 
measured in CLASXS. One therefore may wonder about the fre- 
quency with which prominent large-scale structures are sampled 
in X-ray surveys of different sky areas, i.e. the effects of cosmic 
variance. Indeed, based on simulated galaxy mock catalogs over 
2 deg^ fields (Kitzbichler & White 2006), some evidence exists 
that the COSMOS field has some excess of structures with re- 
spect to the average. 

The correlation length of XMM-COSMOS AGN can be 
compared to that of different galaxy populations at z ~ 1 . Coil 
et al. (2004) find ro - 3.2 + 0.5 /z"' Mpc for emission line 
galaxies in the DEEP2 survey, while Meneux et al. (2006) find 
ro = 2.5 ± 0.4/1"' Mpc for star-forming blue galaxies in the 
Vimos-VLT Deep Survey (VVDS, Le Fevre et al. 2004). The 
populations of red absorption-line galaxies in the same surveys 
have instead larger correlation lengths: ro ~ 6.6/1"' Mpc for 
absorption line galaxies in the DEEP2 (Coil et al. 2004) and 
ro = 4.8 + 0.9 /i"' Mpc for red, early type galaxies in the VVDS 
(Meneux et al. 2006). Recently, a correlation length as large as 
ro = 5.1 ± 0.8 /j~' Mpc has been measured for luminous infrared 
galaxies (LIRG, L/s > 10" L©) at z ~ 0.8, which are forming 
stars at high rates {SFR > 11 yr"'; see Gilli et al. 2007b). 
Since at z ~ 1 star formation is closely related to galaxy mass 
(Noeske et al. 2007, Elbaz et al. 2007), even LIRGs, as well as 
z ~ 1 early type galaxies, are massive objects with stellar mass 
>3x 10'° Mq. The fact that XMM-COSMOS AGN show 
similar correlation length to these systems (see Fig.|9]), suggests 
that, similar to what is observed at z = (Kauffmann et al. 2004) 
and at z ~ 2 (Daddi et al. 2007), at z ~ 1 nuclear activity is 
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hosted by the more massive galaxies (see also Georgakakis et al. 
2007). This is in good agreement with the analysis by Silverman 
et al. (2008b), who investigated the occurrence of nuclear activ- 
ity on a sample of ~ 8000 galaxies selected from the zCOSMOS 
spectroscopic catalog, finding that the fraction of galaxies host- 
ing an AGN increases towards large stellar masses at z < 1 .0. In 
particular, most (~ 80%) AGN at z ~ 1 reside in galaxies with 
stellar mass M* > 3 x 10'" Mq, in agreement with the conclu- 
sions from our clustering analysis. 

6.2. The connection with dark matter halos 

While on small scales, comparable to the dimensions of dark 
matter halos, AGN and galaxy clustering are difficult to predict 
because of merging and interactions that can trigger a number of 
physical processes, on larger scales (e.g., > 1 Mpc), where 
interactions are rare, the AGN correlation function should follow 
that of the hosting dark matter halos. 

An interesting consequence is that one can estimate the 
masses of the typical halos hosting an AGN population by sim- 
ply comparing their clustering level. According to the standard 
ACDM hierarchical scenario, dark matter halos of different mass 
cluster differently, with the more massive halos more clustered 
for any given epoch, and it is straightforward to compute the 
correlation function for halos above a given mass threshold. It is 
worth noting that, since less massive halos are more abundant, 
the correlation function of halos above a given mass threshold 
is very similar to the clustering of halos with mass close to that 
threshold. Also, it is important to note that, as far as our mea- 
surements are concerned, the best-fit clustering parameters are 
obtained from datapoints mostly on large scales (rj, > 1 
Mpc; see Fig.|6]l. Therefore the measured ro and y values are es- 
sentially due to the clustering signal on large scales, where the 
AGN correlation function follows that of the dark matter, allow- 
ing a meaningful comparison with the clustering expected for 
dark matter halos. 

We considered the dark matter halo catalogs available for the 
Millennium simulatioi£|(Springel et al. 2005). Halo catalogs are 
available at different time steps along the simulation. Here we 
considered those at z ~ 1 (parameter stepnum=41 in the simu- 
lation). In total there are about 1.6 x 10^ halos with mass above 
10'" /?"' Mq in a cubic volume of 500 /i"' Mpc on a side. We 
computed the correlation function and the space density of halos 
above 7 mass thresholds ranging from log(M/M0)=lO.8 to 13.2 
in steps of 0.4. Here we use as halo mass estimator the simula- 
tion parameter m_Crit20O, defined as the mass within the radius 
where the integrated halo overdensity is 200 times the critical 
density of the simulation. The halo correlation length was esti- 
mated by fitting with a power law the halo correlation functions 
on scales above r - 1 /;"' Mpc. The results are shown in Fig.|9l 
where it is clear that more massive halos are more clustered and 
less numerous. 

We computed the space density of AGN in XMM-COSMOS 
as expected from published X-ray luminosity functions (see de- 
tails in Sect. 6.5) and compared the ro and density values of our 
population with those of other AGN and galaxy populations at 
z ~ 1 and with those of dark matter halos at z ~ 1 as computed 
above. We considered the XMM-COSMOS AGN in the redshift 
range z - 0.4 - 1 .6, i.e. around the peak of the selection function 
and excluding the redshift structure at z = 0.36. The comparison 
is shown in Fig.|9l 



^ see |http://www.mpa-garching .mpg .de/millenniiiml 



9 




0.0001 0.001 0.01 

space density [h^ Mpc"^] 

Fig. 9. Space density vs correlation length for the XMM- 
COSMOS AGN compared to that of other AGN and galaxy 
populations at z ~ 1, as labeled. The trend for the Millennium 
dark matter halos at z ~ I above different mass thresholds is 
also shown as a shaded region. More massive halos (log of the 
threshold mass is labeled) are less abundant and more clustered 
than less massive ones. The XMM-COSMOS AGN datapoint 
(big filled circle) refers to the z=0.4-l .6 sample. Solid and dotted 
errorbars correspond to Poissonian and bootstrap uncertainties, 
respectively. 



By comparing the halo and the galaxy ro values, XMM- 
COSMOS AGN appear to be hosted by halos with masses > 
2.5 X 10'^ Mq, similar to absorption line galaxies and LIRGs, 
which indeed show similar correlation lengths. However, while 
absorption line galaxies and LIRGs appear to be more abundant 
than the hosting halos (with an average of 2-4 such galaxies 
per halo), XMM-COSMOS AGN appear to be a factor of > 5 
less abundant than the hosting halos, suggesting that nuclear ac- 
tivity is present in about ~ 15 - 20% of halos of that mass. 
Considerations about the duty cycle and lifetimes of XMM- 
COSMOS AGN will be presented in Sect.l63] 

6.3. Evolution of AGN clustering 

To investigate the evolution of the AGN clustering properties 
with redshift we combined the results from XMM-COSMOS 
with recent findings from other X-ray and optical surveys. When 
necessary, the results were corrected to the cosmology adopted 
here. The values of the correlation lengths reported in this sec- 
tion were usually calculated by fixing y to 1.8, therefore allow- 
ing a consistent comparison. When different slopes were mea- 
sured/adopted, we discuss the case and verify the effects of as- 
suming y = 1.8. Only results from spatial clustering analysis are 
considered. 

As for the X-ray surveys we considered the results from the 
ROSAT NEP survey and from the Chandra Msec fields. In the 
NEP survey, Mullis et al. (2004) found a correlation length of 
ro ~ 7.4 +1.8 /j ' Mpc on scales of 5 - 60 ' Mpc for source 
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Fig. 10. Correlation length vs redshift for XMM-COSMOS 
AGN compared to that of other AGN samples. The big blue filled 
circle refers to the z=0.4-1.6 XMM-COSMOS sample. Big red 
filled circles refer to the z < 1 and z > 1 samples. The big open 
circle refers to the z < 1 sample when excluding the z=0.36 
spike. Solid and dotted errorbars correspond to Poissonian and 
bootstrap uncertainties, respectively. The filled square at z ~ 0.1 
refers to the AERQS QSO sample (Grazian et al. 2004). Filled 
circles at z ~ 0.2, 0.7, 0.9 refer to the NEP (MulHs et al. 2004), 
CDFS and CDFN samples (Gilli et al. 2005), respectively. Filled 
triangles refer to the CLASXS sample (see eg Yang et al. 2006). 
Open squares refer to optically selected QSOs in the 2QZ sam- 
ple (Croom et al. 2005). The solid curves show the correlation 
length vs redshift relations expected for dark matter halos in 
the Millennium simulation above different mass thresholds as 
labeled. 



pairs at a median redshift z — 0.22. In the CDFS and CDFN, the 
correlation length measured by Gilli et al. 120051 is ro - 10.3 + 
1.7 /z"' Mpc and ro = 5.5 + 0.6 Mpc, respectively. Although 
the best-fit slopes in the Chandra Msec fields are rather flat (y - 
1.3-1 .5), the best-fit correlation lengths increase by only ~ 15% 
if the slope is fixed to 1.8 (GilH et al. 20051). 

As for optically selected AGN, we considered the results of 
Croom et al. (2005) based on more than 20000 objects in the fi- 
nal catalog of the 2dF QSO Redshift Survey (2QZ), where the 
QSO correlation length is found to increase significantly from 
z ~ 1 to z ~ 2.5, and very flat slopes (y ~ 1.1-1.2) have been 
measured. As discussed by Croom et al. (120051) . these flat slopes 
stem from redshift-space distortions that are relevant when the 
correlation function is measured down to small scales in redshift 
rather than in real space. The real-space clustering for the to- 
tal 2QZ sample has instead been measured by Da Angela et al. 
(120051 ) via the projected correlation function. On the same scales 
as considered by Croom et al. (|2005 ) and when approximating 
f (r) by a single power law, they found that, while the slope of the 
total 2QZ sample steepens significantly from y = 1.20 ± 0.10 to 
y - 1.85 + 0.13, the correlation length only marginally decreases 
by 10% (from ro = 5.5 + 0.5 /j"' Mpc to ro = 5.0 + 0.5 /z"' 



Mpc). In the following we therefore simply consider the val- 
ues as measured by Croom et al. (2005) for the 2QZ correlation 
lengths in different redshift bins. These results are consistent 
with those obtained by Porciani et al. (2004) using 2dF QSOs 
in a narrower redshift range. In the local Universe (z ~ 0.07), 
the clustering of bright optical QSOs (B < 15 mag) has been 
recently determined by Grazian et al. (120041) by means of the 
Asiago-ESO/RASS QSO survey (AERQS). These authors mea- 
sured ro = 8.6 + 2.0 /i"' Mpc at a median redshift of z ~ 0.1 on 
comoving scales 1 -30/i"' Mpc by fixing the correlation slope to 
7=1 .56. Given the above considerations for the Chandra Msec 
fields and the 2QZ and given the rather large uncertainties we 
have to deal with, we consider the value quoted by Grazian et al. 
(2004) as if obtained by fixing y to 1.8. All the measurements 
discussed above are shown in Fig.fTOl 

Unfortunately, it is not possible to perform a completely un- 
biased comparison between the various samples because differ- 
ent redshifts generally sample different luminosities, and AGN 
clustering may be a function of AGN luminosity if the latter cor- 
relates with the mass of the hosting dark halo (e.g. Kauffmann 
& Haehnelti2002i). 

In Fig. [lOl sources at z < 0.3 and z > 2 appear to be the 
most clustered ones, and these also correspond to the most lu- 
minous AGN. At z < 0.3 the median 0.5-10 keV luminosity of 
the AGN in the AERQS and NEP samples is about logL^ - 44.4 
(see MulHs et al. 2004 and Gifli et al. I2005t . At z > 2 the me- 
dian absolute B-band luminosity of the 2QZ QSOs corresponds 
to a median 0.5-10 keV luminosity of logL^ - 44.7 (assuming 
a standard QSO SED, e.g. Elvis et al. 1994). The less clustered 
sources are found at z ~ 1, but these have lower luminosities 
(iogLx = 43 - 44). In general, a clear dependence of cluster- 
ing amplitude on AGN luminosity has not been observed yet. 
On the contrary, the available evidence, if any, points towards a 
similar clustering for sources at the same redshift but with dif- 
ferent luminosities. Croom et al. (2005) and Porciani & Norberg 
(2006) could not find any significant evidence of luminosity de- 
pendent clustering in the 2QZ. From a cross-correlation analysis 
between galaxies and AGN, Adelberger & Steidel (2005) claim 
that AGN at z ~ 2 cluster similarly within a 10 mag luminos- 
ity range. From a theoretical point of view, one would expect 
little clustering dependence on the observed AGN luminosity if 
this is not directly related to the host halo mass; i.e., if, at any 
given redshift, objects that reside in halos within a narrow mass 
range have very different luminosities (eg. Lidz et al. 2006). For 
instance, even assuming a dependence of black hole (and host 
galaxy) mass on the hosting halo mass, a wide spread in the 
distribution in the Eddington ratios would make BH of similar 
masses radiate at very different luminosities. Indeed, although 
the average Eddington ratio of SDSS QSOs has been shown to 
increase towards high luminosities, the spread in the distribu- 
tion is wide (McLure & Dunlop 2004). Moreover, it has been 
recently suggested (Gavignaud et al. 2008) that the dispersion in 
the black hole mass-luminosity increases even more for lower 
AGN luminosities. Finally, in the local Universe, Constantin & 
Vogeley (2006) find that low-luminosity LINERs are more clus- 
tered than higher luminosity Seyfert galaxies, showing that the 
relation between AGN luminosity and clustering may even be re- 
versed for low-luminosity AGN with respect to the expectations 
based on a monotonically increasing relation between luminos- 
ity and black hole mass. Large statistical samples, beyond the 
reach of the data presented in this work, are needed to firmly 
establish any dependence of AGN clustering on luminosity (see 
eg. Porciani & Norberg 2006). 
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Fig. 11. Expected redshift evolution of the bias and correlation 
length of different XMM-COSMOS AGN samples according to 
a conserving scenario. The evolution curves are normalized to 
the observed datapoints. Blue filled circle: z - 0.4 - 1.6 sample. 
Red circles: z < 1 and z > 1 samples. The open circle shows 
the result for the z < 1 sample when the z = 0.36 structure is 
excluded. SoUd and dotted errorbars correspond to Poissonian 
and bootstrap uncertainties, respectively. 



Overall, when removing the redshift structure at z=0.36, the 
clustering of XMM-COSMOS AGN appear in good agreement 
with what is measured for optical and X-ray selected AGN at 
different redshifts. A larger correlation length is instead found 
for objects at z < 1 if the redshift structure is not removed. 

To interpret our clustering measurements at different red- 
shifts, we considered the halo catalogs in the Millennium sim- 
ulation and computed their correlation function above differ- 
ent halo mass thresholds and at different redshifts. We essen- 
tially repeated the computation presented in Sect. 6.2 for halos 
at z = 0.1, 0.5, 1.0, 1.5, 2.0, and 3.0. The ro vs redshift curves for 
halos above different mass thresholds are shown in Fig.[TO] Both 
our z ~ 1 and z > 1 AGN samples seem to be hosted by halos 
with mass above logiM/M^) > 12.4. For XMM-COSMOS AGN 
at z < 1 the minimum mass of the host halos varies from 12.4 to 
12.8 depending on whether the z=0.36 structure is excluded or 
included from the computation of ro. 

6.4. Descendants of z ~ I AGN 

As shown in the previous sections, under simple assumptions, it 
is possible to use the spatial clustering of an extragalactic source 
population measured at a given epoch to estimate the typical 
dark matter halos in which these objects reside. Furthermore, 
it is also possible to estimate their past and future history by fol- 
lowing the halo evolution in the cosmological density field. A 
useful quantity for such analyses is the bias factor, defined as 
b\r,z,M) = ^A(r,z,M)/^m(r,z), where ^A(r,z,M) and ^„,(r,z) 
are the correlation function of the considered AGN or galaxy 
population and that of dark matter, respectively. In general the 



bias parameter can be a function of scale r, redshift z, and object 
mass M. For simplicity we adopt the following definition here: 

bHz)^^A(8,z)/U8,z) (11) 

in which ^^(8,z) and ^,„(8,z), are the galaxy and dark matter 
correlation functions evaluated at 8 /i ' Mpc, respectively. The 
AGN correlation function has been measured directly in this 
work, while the dark matter correlation function can be esti- 
mated using the following relation (e.g., Peebles 1980): 

U^,z)^a-l(z)/J2 (12) 

where J2 = 72/[(3 - y)(4 - y)(6 - r)2>'], and (tI(z) is the dark 
matter mass variance in spheres of 8 /i"' Mpc comoving ra- 
dius, which evolves as <ts,{z) = crg,(Q)D(z). Also, D(z) is the 
linear growth factor of perturbations, while crg(O) is the rms 
dark matter fluctuation at the present time, which we fix to 
erg - 0.8 in agreement with the recent results from WMAP3 
(Spergel et al. 2007)0 While in an Einstein - De Sitter cosmol- 
ogy the linear growth of perturbations is simply described by 
DEdsiz) - (1 + z)"', the growth of perturbations is slower in a 
A-dominated cosmology. We consider here the so-called growth 
suppression factor g{z) - D{z)l DEdsiz) as approximated analyt- 
ically by Carroll, Press & Turner (1992). The resulting bias for 
the AGN in the XMM-COSMOS field is b(\.Q) = 2.0 + 0.2. 

Once the bias of the AGN population at its median redshift 
has been estimated using the above relations, it is possible to fol- 
low the time evolution of the bias with models that rely on sim- 
ple assumptions. Two popular scenarios that encompass two ex- 
treme hypotheses are the conserving model, in which objects do 
not merge at all (Nusser & Davis 1994, Fry 1996) and the merg- 
ing model in which objects merge continuously (e.g. Moscardini 
et al. 1998). In the first hypothesis the number of objects is con- 
served in time, and galaxies behave as test particles whose spa- 
tial distribution simply evolves with time under the gravitational 
pull of growing dark matter structures. In the second hypothesis, 
object merging follows the continuous merging of the hosting 
dark matter halos, in such a way that only those objects and ha- 
los which have just merged are observable at any given epoch. 

From an observational point of view, the fraction of galax- 
ies in mergers appears to be a very debated issue. Recent works 
suggest that close galaxy pairs (merger candidates) are a strong 
function of redshift, evolving as (1 -i-z)"'""* (Kartaltepe et al. 2007, 
Kampzycket al. 2007). By extrapolating the current estimates, at 
Z ~ 2 about 50% of luminous galaxies are expected to be found 
in close pairs/mergers. However, at z ~ 1 the fraction of galaxies 
in close pairs is still ~ 8% and decreases to ~ 0.1% at z ~ 0.1. 
In the following we will consider the non merging conserving 
model as a fairly adequate representation of the bias evolution of 
z ~ 1 XMM-COSMOS AGN towards lower redshifts; i.e., it will 
be used to estimate the Hkely descendants of XMM-COSMOS 
AGN. On the contrary, since merging is expected to be signif- 
icant towards higher redshift, we will not try to estimate their 
high-z progenitors. 

In the galaxy conserving model, the bias evolution can be 
approximated by 

b{z)^l + [b{Q)-\]ID{z) (13) 

where b{Q) is the population bias at z = (Nusser & Davis 1994, 
Fry 1996, Moscardini et al. 1998). Once biz) is determined, the 

Only small differences arise if we assume erg = 0.9 as in the 
Millennium simulation. The other relevant cosmological parameters as- 
sumed in this work are the same as in the Millennium simulation. 
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evolution of ^a(8,z) and hence of ro(z) can be obtained by in- 
verting Eq. [TT] A value of y ~ 1.8 for the slope is assumed in 
the above relations. Little difference arises when using y - 1.9. 

In Fig. [TT] we show the evolution of b(z) and r()(z) for vari- 
ous XMM-COSMOS AGN samples, including the z = 0.4 - 1.6 
sample, the sample at z > 1 and the sample at z < 1 with or with- 
out the structure at z = 0.36. By z = the correlation length of 
XMM-COSMOS AGN should evolve to ro > 6/i ' Mpc, which 
is typical of passive, early type galaxies in the local Universe 
(Colless et al. 2001, Zehavi et al. 2004). The correlation slope of 
the local early type population y ~ 1.8 - 2.0 also appears con- 
sistent with that of XMM-COSMOS AGN at z ~ 1. In principle, 
the evolution curves shown in Fig. [TT] can also be used to pre- 
dict which ro value a given XMM-COSMOS subsample should 
have as a function of redshift, allowing a proper comparison be- 
tween measurements obtained at different redshifts. Indeed, the 
correlation length of AGN in the z > 1 sample (median z ~ 1.5) 
is expected to evolve to ro ~ 7.5 Mpc by z ~ 0.7, i.e. the 
median redshift of the z < 1 sample, whose correlation length 
has been measured as varying between 8.0 and 5.2 /i"' Mpc, de- 
pending on the inclusion of the z = 0.36 structure. Given this 
uncertainty and the large errorbars in Fig. [TT] it is still difficult 
to claim that objects at redshift greater or smaller than 1 are sam- 
pling different environments. 

6.5. Estimating tine AGN iifetime 

Under simple assumptions it is possible to put limits on the AGN 
lifetime at any given redshift. Following Martini & Weinberg 
(1200 1 ). we assumed that the AGN in our sample reside within 
halos above a given mass threshold and that each halo hosts at 
most one active AGN at a time. The AGN Ufetime Iq can then 
be estimated with the following relation: 



1>(z) 



f 



dM- 



tH(M,Z) 



n(M,z), 



(14) 



where <l)(z) is the comoving space density of AGN above a given 
luminosity, M„„„ the minimum mass of the halos hosting an 
AGN, n{M, z) the comoving space density of halos of mass M 
at redshift z, and tH{M,z) is the lifetime of halos of mass M at 
redshift z. 

The definition of halo lifetime is somewhat ambiguous since 
halos are continuously accreting matter. Martini & Weinberg 
( 1200 11 1 defined tH{M,z) as the median time interval for a halo 
of mass M to be incorporated into a halo of mass 2M and used 
the extended Press-Schechter formalism to calculate it. To a first 
approximation f//(M, z) ~ tu(z), where tuiz) is the Hubble time 
at redshift z. With these approximations Eq.[T4]can be rewritten 



fe(z) = f„(z) 



(15) 



where (^h{z) - dM n(M, z) is the comoving space density 
of halo with mass above Mm,„. 

Since M„„„ is known from the comparison between the halo 
and the AGN correlation length, it is straightforward to esti- 
mate <1>h(z) from the number of halos with M > M„,i„ within 
the Millennium simulation box. For halos with M > 2.5 x 
lO'^ h'^ Mo, where XMM-COSMOS AGN at z ~ 1 reside, 
the space density is Oh(z) - 10"^ Mpc"^ (see Fig.|9]l. For the 
cosmology adopted here the Hubble time at z = 1 is ~ 6.3 Gyr. 



The comoving space density of XMM-COSMOS AGN has 
been estimated by considering literature X-ray luminosity func- 
tion of AGN selected in the 2-10 keV band, which should there- 
fore include unobscured, as well as moderately obscured, objects 
as the objects populating our sample. Once accounting for band 
effectfl the median luminosity of ourz ~ 1 sample (logL=43.7 
in the 0.5-10 keV band; see Table 1) translates into a 2-10 keV 
luminosity of logL~ 43.5. At these luminosities, z ~ 1 AGN 
in the La Franca et al. (2005) XLF have a space density of 
~ 3 X lO"'* Mpc"-'. A similar value for the AGN density is ob- 
tained when using the XLF by Ueda et al. (2003). Therefore, by 
considering an AGN density of ~ 1 .8 x 10""* Mpc"^ (obtained 
by rescaling the La Franca et al. space density by the fraction of 
objects with Iab < 23, as is the case for our selection), a duty 
cycle tQ/tn of 0.18 is obtained, which translates into an AGN 
lifetime of ~ 1.1 Gyr. 

This estimated lifetime is more than one order of magnitude 
longer than that estimated by Porciani et al. (2004) for bright 
optical QSOs at z ~ 1 in the 2QZ survey. The difference in the 
measured lifetime is essentially due to the difference between 
the space density of XMM-COSMOS AGN and 2QZ QSOs at 
z ~ 1 (~ 2 X lO""* vs K IQ-^ Mpc"^ see Table 1 in Porciani 
et al. 2004). Such a difference is, on the other hand, expected 
given the relatively bright limiting magnitude (wg ~ 20.8) of 
the 2QZ sample that, in addition, does not include obscured 
AGN. The estimated Ufetime for XMM-COSMOS AGN is sig- 
nificantly shorter than the ~ 8 Gyr time span between z = 1 
and z = 0. This, in combination with the estimate that XMM- 
COSMOS AGN will cluster with ro = 8 at z = 0, depicts a 
consistent scenario in which XMM-COSMOS AGN will switch 
off by z = 0, leaving relic (dormant) supermassive black holes in 
local elliptical galaxies. 



7. Conclusions 

We have studied the clustering properties of 538 moderately 
luminous AGN at z = 0.2 - 3 in the 2 deg^ COSMOS field, 
selected in the X-rays and spectroscopically identified to 
Iab < 23. Our main results can be summarized as follows: 

1. The projected correlation function w{rp) on scales 
rp = 0.3 -40 /;"' Mpc can be approximated by a power law with 
correlation length ro - 8.6 ± 0.5 /i"' and slope y = 1.9 + 0.1 
(Poisson errors; bootstrap errors are a factor of ~ 2 larger). This 
represents the most significant measurement of clustering of 
X-ray selected AGN to date. 

2. Part of the signal, in particular an excess on projected 
scales rp - 5 - 15 /z"' Mpc, is due to a large scale structure 
at z = 0.36. When excluding this structure or computing w(rp) 
for objects in a narrower redshift interval around z ~ 1, the 
correlation length decreases to ro = 5-6 /;"' Mpc, similar to 
what is observed in large samples of optically selected QSOs at 
the same redshift. 

3. Objects with different absorption properties do not show 
significant evidence for different clustering properties. Broad 
line AGN are consistent with inhabiting the same environments 
of non-broad line AGN. Similar results are obtained when 
considering X-ray absorbed and X-ray unabsorbed AGN. 



an X-ray photon index of F = 1.9 is assumed 
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4. No significant difference is found in the clustering 
properties of objects at redshifts below or above 1 . 

5. The con-elation length measured for XMM-COSMOS 
AGN at z ~ 1 is similar to that of early type galaxies and lumi- 
nous infrared galaxies at the same redshift. This, in agreement 
with other studies, suggests that z ~ 1 moderately luminous 
AGN are found preferentially in massive (M > 3 x 10'° Mq) 
galaxies. 

6. By using public halo catalogs from the Millennium 
simulation, we estimated XMM-COSMOS AGN to reside 
within dark matter halos of mass M > 2.5 X 10'- h-^ Mq. 

7. According to a simple conserving scenario for clustering 
evolution, the relics of z ~ 1 AGN are expected to be hosted by 
local bright L ~ Li, ellipticals by z-Q. 

8. By combining the number density of XMM-COSMOS 
AGN with that of the hosting dark matter halos, we estimated an 
AGN duty cycle of 0.1, which translates into an AGN lifetime 
of ~ 1 Gyr The estimated lifetime is more than one order of 
magnitude longer than estimated for optically bright QSOs at 
the same redshift. This is mainly due to the higher number 
density of AGN found in X-ray selected samples. 
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